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Abstract 

Using representation theoretic work on the Whitehouse module, a formula is obtained for 
the cycle structure of a riffle shuffle followed by a cut. It is proved that the use of cuts does 
not speed up the convergence rate of riffle shuffles to randomness. Type A afflne shuffles are 
compared with riffle shuffles followed by a cut. Although these probability measures on the 
symmetric group S'„ are different, they both satisfy a convolution property. Strong evidence is 
given that when the underlying parameter q satisfies gcdin, q — l) = 1, the induced measures on 
conjugacy classes of the symmetric group coincide. This gives rise to interesting combinatorics 
concerning the modular equidistribution by major index of permutations in a given conjugacy 
class and with a given number of cyclic descents. Generating functions for the first pile size in 
patience sorting from decks with repeated values are derived. This relates to random matrices. 

Key words: card shuffling, conjugacy class, sorting, random matrix, cycle structure, Whitehouse 
module. 

1 Introduction 



In an effort to study the way real people shuffle cards, Bayer and Diaconis [BaD] performed a 



definitive analysis of the Gilbert-Shannon- Reeds model of riffle shuffling. For an integer k> 1, a,k- 

shuffle can be described as follows. Given a deck of n cards, one cuts it into k piles with probability 

( ■ .". ■ ) 
of pile sizes ji , • • • , jk given by ^^\n^ ■ Then cards are dropped from the packets with probability 

proportional to the pile size at a given time (thus if the current pile sizes are Ai , • • • , A^ , the next 



card is dropped from pile i with probability . ,,\a )■ It was proved in [BaD| that ilog2n shuffles 
are necessary and suffice for a 2-shuffle to achieve randomness (the paper H had established this 
result asymptotically in n). It was proved in |DMP| that ii k = q is a, prime power, then the 



chance that a permutation distributed as a (7-shuffle has n^ i-cycles is equal to the probability 
that a uniformly chosen monic degree n polynomial over the field Fg factors into 71^ irreducible 
polynomials of degree i. 

These results have recently been extended to other Coxeter groups and placed in a Lie theoretic 



setting. The paper [BeBe] defines hyperoctaheral shuffles using descent algebras and the paper 



|BiHR| relates Gilbert-Shannon-Reeds shuffles to hyperplane walks. The paper |F2| defines riffle 



shuffling for arbitrary real hyperplane arrangements, with convergence rates obtainable from the 
theory in [ BiHR| ] . The results of [ DMP| ] are given a Lie theoretic formulation and extension, at least 



for types A and B, in |F3| and p^ ]. (Random polynomials are replaced by the semsimple orbits of 



the adjoint action of a finite group of Lie type on its Lie algebra, and even in type A restrictions 
on the characteristic are needed). The paper ^If considers cycle structure of permutations after 
biased shuffles, and generalizations based on dynamical systems appear in [Lal|,| pa^ ]. 



It is worth commenting that the combinatorics of type A riffle shuffles is intimitely connected 
to cyclic and Hochschild homology [ Han[ | , |GerSf| to the Poincare-Birkhoff-Witt theorem [BeW], free 



Lie algebras [Ga], and to Hopf algebras (Section 3.8 of |SSt|). In recent work, Stanley pta| has 



related biased riffle shuffles with representation theory of the symmetric group, thereby giving 
an elementary probabilistic interpretation of Schur functions and a different approach to some 



work in the random matrix community. He recasts and extends results of [BaD] and |F1] using 
quasisymmetric functions. 



Using a construction of Cellini |Cel[| , |Ce2 |, the paper |F5| studies combinatorially much more 



intricate shuffles called afflne shuffles (they are reviewed in Section y). The conjectures of |F5| state 
in type A that the chance that a permutation distributed as an afflne q shuffle has nj z-cycles is 
equal to the probability that a uniformly chosen monic degree n polynomial with constant term 
1 over the field Fq factors into rij irreducible polynomials of degree i (the abstraction of these 
polynomials is semisimple conjugacy classes of finite groups of Lie type). These conjectures are 
remarkable in the sense that (unlike the Lie algebra case [p^ ) , no restrictions on the characteristic 
are needed and there seems to be a reasonably natural way of associating to such a polynomial a 
permutation in the right conjugacy class, such that choosing the polynomial at random induces the 
afflne shuffling measure. As emerges in [p5| ] (which gives an application to dynamical systems and 
hints at number theoretic applications), this conjecture seems challenging. 

The second type of shuffle to be studied in this paper is riffle shuffling followed by a cut at 
a uniformly chosen random position. Section ^ develops combinatorial preliminaries of shuffles 
followed by cuts. It is shown there that doing r "A;-shuffles followed by a cut" is the same as doing 
r /c-shuffles and then a single cut (this is known for k = 2 from | Ce3[ ). It is proved that the 
total variation distance between a sequence of x riffle shuffles and y cuts (performed in any order) 
and the uniform distribution on Sn is at least the total variation distance between a sequence 
of X riffle shuffles on Sn~i and the uniform distribution on Sn-i- In this precise sense cuts do 
not help speed up riffle shuffles. This perhaps surprising fact can be contrasted with a result of 



Diaconis [D2|, who used representation theory to show that although shuffling by doing random 
tranpositions gets random in ^nlog{n) steps, the use of cuts at each stage drops the convergence 
time to ^nlog{n) steps. It would be worthwhile and interesting to systematically study the effect 



of cuts on convergence rates of shuffling methods. Section ^ also shows that a riffle shuffle followed 
by a cut is at least as random (and sometimes moreso) than a cut followed by a riffle shuffle. 

Section Q uses representation theoretic work on the Whitehouse module to obtain a formula for 
the cycle structure of a riffle shuffle followed by a cut. More precisely, it is shown that the chance 
that a permutation distributed as a g riffle shuffle followed by a cut has iii z-cycles is equal to the 
probability that a uniformly chosen monic degree n polynomial with non-zero constant term over 
the field Fq factors into nj irreducible polynomials of degree i. The connection with representation 



theory is illuminating; it follows for instance that the work of | DMP | on the cycle structure of 



riffle shuffles is equivalent to a representation theoretic result of Hanlon [Han|. The Whitehouse 
module is interesting in its own right and appears in many places in mathematics; the interested 
reader should see the transparencies for a talk on the Whitehouse module on Richard Stanley's 
MIT website. 

Section ^ gives strong evidence for the assertion that affine shuffles and shuffles followed by a 
cut, though different probability measures, coincide when lumped according to conjugacy classes 
provided that the prime power q satisfies gcd{n, q — 1) = 1. This leads to fascinating combinatorics 
concerning the modular equidistribution by major index of permutations in a given conjugacy class 
and with a given number of cyclic descents. 

Section P considers questions which turn out to be related to cycle structure in multiset per- 



mutations. To motivate things, we follow the recent preprint |AD| in its description of patience 
sorting (which can also be viewed as a toy model for Solitaire, a card game which unlike Blackjack 
has been extremely difficult to analyze mathematically). The simplest case is that one starts with 
a deck of cards labelled 1, 2, • • • , n in random order. Cards are turned up one at a time and dealt 
into piles on the table according to the rule that a card is placed on the leftmost pile whose top 
card is of higher value. If no such pile exists, the card starts a new pile to the right. For example 
the ordering 

7513624 

leads to the arrangement 

1 

5 2 4 

7 3 6 



The survey paper ||AD[| details connections of patience sorting with ideas ranging from random ma- 
trices and the Robinson-Schensted correspondence to interacting particle systems ( [|AD2 |,[ BaiDeJ [ 



|Han|,[Ilai]). For example the number of piles in patience sorting is the length of the longest in- 



creasing subsequence of a random permutation. After an appropriate renormalization, this statistic 
has the same distribution as the largest eigenvalue of a random GUE matrix. The usefulness of 
I AD] is its insight that other functions of the random shape obtained through patience sorting have 



interesting structure. They give results for various pile sizes and suggest the search for analogous 
results for the following two variations of patience sorting from decks with repeated values: ties 
allowed (i.e. a card may be placed on a card of the same value) or ties forbidden. Section ^ gives 
generating functions for the first pile size for these two variations, and also for the case where each 
card is chosen at random from a finite alphabet. We remark that our approach also works for a 
game one could call rra-Solitaire in which each card value may be placed on a given pile up to m 
times (ties allowed and ties forbidden corresponding to m = oo,m = 1 respectively). As these 
games are of less interest and the calculation is grungier, we omit it. 

It may strike the reader that Section g, though involving cards and cycle structure, is unrelated 
to card shuffling. However, there is a relation. To elaborate, suppose one picks a random word 
of length n from a totally ordered alphabet, where the probability of getting symbol i is Xj. The 
number of piles in patience sorting applied to the word is the length of the longest weakly increasing 
subsequence. Each word corresponds to a possible riffle shuffle and the longest increasing subse- 
quence in the corresponding permutation is the longest weakly increasing subsequence in the word 

To close the introduction, we introduce some terminology that will be used throughout the 
paper. Cr{m) will denote the Ramanujan sum ^^ g^'^*'"*''' where the sum is over all / less than 
and prime to r. An element w in the symmetric group Sn is said to have a descent at position i 
ifl<i<n — 1 and w{i) > w{i + 1). The notation d{w) will denote the number of descents of w. 
The major index of w, denote maj{w) will be the sum of the positions i at which w has a descent. 
The permutation w is said to have a cyclic descent at n if w{n) > 'w(l). Then cd{w) is defined as 
d{w) if w has no cyclic descent at n and as d{w) + 1 if w has a cyclic descent at n. As noted in 
|]Cel| , the concepts of descents and cyclic descents have analogs for all Weyl groups. (The descents 
are the simple positive roots mapped to negative roots and w has a cyclic descent if it keeps the 
highest root positive). 



2 Type A affine shuffles 

This section gives six definitions for wfiat we call type A affine /c-shuffles. The first two are due to 



Cellini Cel | and are the best in that they generalize to all Weyl groups. The next four are due to the 
author [ p5[ | and are very useful for computational purposes. (The final definition is a "physical" 
description of affine type A 2-shuffles. A good problem is to extend the physical description to 
higher values of A;). In all of the definitions k is a positive integer. A seventh definition (which 
is conjectural but very conceptual and potentially valid for all Weyl groups) appears in the final 
section of [p5[| . As it requires some effort to describe and will not be used here, we omit it. 
Definitions of afRne type A fc-shuflfles 

1. Let Wfc be the subgroup of the type A affine Weyl group generated by reflections in the n 
hyperplanes {xi = X2, ■ ■ ■ ,Xn~i = Xn,Xn — xi = k}. This subgroup has index k^^^ in the 
affine Weyl group and has k^~^ unique minimal length coset representatives, each of which 
can be written as the product of a permutation and a translation. Choose one of these k^~^ 
coset representatives uniformly at random and consider its permutation part. Define a type 
A affine /c-shuffie to be the distribution on permutations so obtained. 

2. A type A affine fc-shuffle assigns probability to w^^ G Sn equal to ^„_i multiplied by the 
number of integers vectors (ui, • • • , w„) satisfying the conditions 

(a) vi-\ \-Vn = 

(b) Vi > V2 > ■ ■ ■ > w„, f 1 -Vn<k 

(c) Vi > Wj_|_i if w{i) > w{i + 1) (with 1 < i < n — 1) 

(d) vi < Vn + k ii w{n) > w{l) 

3. A type A affine /c-shuffle assigns probability to w^^ G Sn equal to ^j^j^ multiplied by the 
number of partitions with < n — 1 parts of size at most k — cd{w) such that the total number 
being partitioned has size congruent to —maj{w) mod n. Equivalently, it assigns probability 
equal to -nhn- multiplied by the number of partitions with < k — cd{w) parts of size at most 
n — 1 such that the total number being partitioned has size congruent to —maj{w) mod n. 

4. A type A affine A;-shuffle assigns probability to w~^ G 5„ equal to 



n-\~k — cd{iu)—r 

T[^^r\n,k-cd{w){ fc-cdM )Cr{-maj{w)) if k - cd{w) > 

r 

■^^ \i k — cd{w) = 0, ■maj{w) = mod n 

otherwise. 

5. A type A affine A:-shuffle assigns probability to w^^ G Sn equal to 

^ t Coeff. of a- ^n [f^^^^^ [' + " ^ ^'^^'^ " ] ) , 



r=0 



whre 



(l-g)...(l-gA) 



denotes the o-binomial coefficient T^ — \ — tt — Vttt — t^t — t^-^-- 



6. A type A affine 2 shuffie has the following physical description for the symmetric group S^n- 

Step 1: Choose an even number between 1 and 2n with the probability of getting 2j equal to 
2^-ri.-\ ■ From the stack of 2n cards, form a second pile of size 2j by removing the top j cards 
of the stack, and then putting the bottom j cards of the ffi'st stack on top of them. 

Step 2: Now one has a stack of size 2n — 2j and a stack of size 2j. Drop cards repeatedly 
according to the rule that if stacks 1, 2 have sizes A, B at some time, then the next card comes 
from stack 1 with probability ^ . ^ and from stack 2 with probability ^ , ^ . (This is equivalent 

of the cards in each stack) . 



to choosing uniformly at random one of the (2") inter leavings preserving the relative orders 



The description of X2 is the same for the symmetric group S2n+i, except that at the beginning 



of Step 1, the chance of getting 2j is „j'„ and at the beginning of Step 2, one has a stack 



(2n+l 
1 

of size 2n + 1 — 2j and a stack of size 2j. 



An important property of these shuffies is the so called "convolution property" , which says that 
a k\ shuffie followed by a k^ shuffie is equivalent to a k\ki shuffle. It is interesting that the type A 



riffle shuffies of BaD| satisfy the same convolution property, as do some of the generalizations in 



The conjecture of |F5| that relates the cycle structure of permutations distributed as affine 



g-shuffles to the factorization of polynomials with constant term 1 appears to be interesting. For 
instance it is shown there that for the case of the identity conjugacy class of S'„,, it amounts to the 
?7i = case of following observation in "modular combinatorial reciprocity" . We recently noticed 



that this reciprocity statement appears in an invariant theoretic setting as Hermite reciprocity in 



m 



For any positive integers x, y, the number of ways (disregarding order and allowing repetition) 
of writing m (mod y) as the sum of x integers of the set 0, 1, • • • , y — 1 is equal to the number of 
ways (disregarding order and allowing repetition) of writing m (mod x) as the sum of y integers of 



the sei 0, 1, • • • ,a; — 1 



Let fn,k,d be the coefficient of z^ in {^j-^Y ^'^^ l^t /i be the Moebius function. Let ni{w) be 
the number of i-cycles in a permutation w. Then (loc. cit.) the conjecture is equivalent to the 
truly bizarre assertion (which we intentionally do not simplify) that for all n, A;, 



Y^ Coef. of q-^u-t^ in ^ ^ ^ /^ ^ ^ t^'^(-)g™'^^(-) 1]^^^"^ 



m=0 mod n n=0 ^ ^' ^ ^ ' w^Sn 



. (1 - tg) ... (1 - tq-) 



^ Coef. of q^u^t^ i^ E *' n IKy 



oo oo oo -. 

i = r 

Tib nrt . rf li 



^/*Z]d|,'^('^)/™,fc,»/d 



m=0 mod k—1 k=0 i=lm=l ^ * 

3 Shuffles followed by a cut 



To begin we remark that although [BaD] offers a formula for a shuffle followed by a cut, the formula 
is really for a cut followed by a shuffle, which is different. 

Let s be the element of the group algebra of Sn denoting a /c-riffle shuffle. Let C be the cyclic 
permutation (1 • • • n) and let c = ^ J27=o C". Thus in this notation a shuffle followed by a cut is 
simply cs. The inverse of an element J2 "^wW of the group algebra will be taken to mean ^ r^w~^ . 

It is useful to recall the following formula of Bayer and Diaconis. 



Theorem 1 ([BaL]) The coefficient of a permutation w in the element s is 



1 fn + k-d{w~'^)-l\ 
k^\ n J' 

Theorem derives an analogous formula for a shuffle followed by a cut. 

Theorem 2 The coefficient of a permutation w in the element cs is 

1 fn + k-cd{w~^) -l\ 
nk"^^^ \ n — 1 J 

Proof: Consider instead the coefficient of w in s^^c. This coefficient is equal to 



-| 71—1 

- y Coeff. of wQ^ in s'^ . 

The element wC, maps i io w{i + k mod n). Consequently letting cd{w) be the number of cyclic 
descents of w, there are cd{w) values of k for which wC^ has cd{w) — 1 descents, and n — cd{w) 
values of k for which wC,'' has cd{w) descents. Combining this with Theorem ^ shows that the 
coefficient of w in s~^c is 

1 / ,/ sfn + k-cd(w)\ . ,, ..fn + k-cd(w)-l\\ 

which simplifies to the formula in the statement of the theorem. □ 
This yields the following combinatorial corollary. 

Corollary 1 Let B^^i he the number of elements of Sn with i cyclic descents. Let A^^i he the 
number of elements of Sn with i — 1 descents. Then 

1 ™n-l _ Y^ -Bn.i in+x-i-l\ 

-'• X — 2^l<i<n~l^r\ n~l )■ 

2. If n > 1 then Bn^i = nAn-i^i. 

Proof: The first assertion is immediate from Theorem ||. The second assertion follows from the 
first together with the well-known facts that An^i = An^n+i-i and that An^i is the unique sequence 
satisfying Worpitzky's identity x"^ = Y.i<i<n ^uA'^^'n^) ■ ° 

Theorem ^ appears in []Ce3|| for the case k = 2. As noted there, it implies that (cs)^ = cs^ for 
any natural number h. The proof given here is simpler. 

Theorem 3 esc = cs. 

Proof: Taking inverses and using the fact that c~^ = c, it is enough to show that cs~^c = s~^c. 
The coefficient of w in c,s~^c is 

n-l 



— y Coeff. of C w in s~ c. 
It is easy to see that cd[( w) = cd{w) for all k. The result now follows from Theorem H □ 



10 



Next recall the notion of total variation distance ||Pi — P2II between two probability distributions 
Pi and P2 on a finite set X. It is defined as 



I E l^i(^ 



P2{x) 



x£X 



The book |D2(| explains why this is a natural and useful notion of distance between probability 
distributions. Pi * P2 (the convolution) is defined by Pi * P2{'^) = JlreSn Pii'^'^^^)P2{T), and 
Pi * ■ ■ ■ * Pj^ is defined inductively. The following elementary (and well known) lemma will be 
helpful. 

Lemma 1 Let P, Q be any measures on a finite group G and let U he the uniform distribution on 
G. Then \\P*Q-U\\ < \\Q - U\\. 

Theorem ^ shows that cuts do not speed up the convergence rate of riffle shuffles. 

Theorem 4 1. Let S^ ' , C, U denote the probability distribution corresponding to a k-riffte shuf- 
fle, a cut, and the uniform distribution respectively. Then \\G * S^^' — U\\ < \\S^^' * C — U\\ 
and the inequality can be strict. (In words, a shuffle followed by a cut is more random than 
a cut followed by a shuffle). 

2. Forn>l, \\C * S'^''^ - U\\s,, = \\S^''^ - U\\s„_,. 

3. Let W be the convolution of any finite sequence of riffle shuffles and cuts. Let W' be the 
convolution of the same finite sequence, but with the cuts eliminated. (By abuse of notation, 
these can be viewed on any symmetric group). Then 

\\W-U\\s„>\\W'-U\\s„_,. 

Proof: For the first assertion, observe that Theorem || gives that C * S**^*^^ = C * S^^"^ * C. Now 
use Lemma |^. Computations with the symmetric group S4 show that the inequality can be strict. 
For the second assertion, let B{n, i) be the number of elements of Sn with i cyclic descents and 
let A{n, i) be the number of elements of Sn with i — 1 descents. Observe that 



\C*S^^^ -u\\ 



. n-1 

l,)L.B{n,i) 


/-/£+ 


n—i—\\ -. 
n-1 ) J- 




n/c'^-1 n! 




^ i=i 


/fc+n— j— 1\ 
V n-1 ) 


1 

(n-1)! 


l|5('^-c/|k 


-1- 
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The second equality is the second part of Corollary ^ and the final equality follows from Theorem 

0- 

For the third assertion, the inequality is clear if W has no cuts. Otherwise, combining the fact 

that S^^' * S^^' = £'(*•'-' for any i,j with Theorem |3| shows that W is equivalent to a convolution of 
the form S^'^'^' *C * S^^'^' (with ki or k2 possibly and S^^' denoting the measure placing all mass 
the identity). Now observe that 

WS^'^^K C * S'^''^'> - U\\s„ > ||C7*5('=i)*C*5('^2)-C/||s„ 

= \\C * S^''^''^'^ - U\\s^ 
= \\s('^>'-)-U\\s^_,. 

The first equality is Lemma |l], the second equality comes from Theorem ^, and the third equality 
is the second part of this theorem. D 



A formula for a cut followed by a riffle shuffle appears in [BaD], though it is not evident how it 
could be used to prove part 1 of Theorem ^. 

As a final problem, we observe that the n-cycle C = (1 • • • 't-) is a minimal length Coxeter element 
for type A. As there are analogs of shuffling for other finite Coxeter groups [BeBe|,[F2], it may be 
possible to extend the results of this paper to other Coxeter groups. 

4 Representation theory 

This section uses representation theory to obtain a formula for the cycle structure of a riffle shuffle 
followed by a cut. 

It is useful to recall the notion of a cycle index associated to a character of the symmetric group. 
Letting ni{w) be the number of z-cycles of a permutation w and A^ be a subgroup of Sn, one defines 

Zn{x) as 

^Mx) = ^ExMn«r^"^- 

The cycle index stores complete information about the character x- For a proof of the following 
attractive property of cycle indices, see |Fe]. 

Lemma 2 Let N be a subgroup of Sn and x o class function on N . Then 



12 



^5„(/n4"(x)) = ^iv(x). 

Next, recall that an idempotent e of the group algebra of a finite group G defines a character x 
for the action of G on the left ideal KGe of the group algebra of G over a field K of characteristic 
zero. For a proof of Lemma y, which will serve as a bridge between representation theory and 
computing measures over conjugacy classes, see [Han]. For its statement, let e < u) > be the 
coefficient of w in the idempotent e. 

Lemma 3 Let G he a conjugacy class of the finite group G, and let x be the character associated 
to the idempotent e. Then 

1 

I 
wec wgc 

It is also convenient to define 



1^1 Y. ^H = 51 e < ^ > • 



w£S„ i 

which makes sense for any element e of the group algebra. Note that one does not divide by the 
order of the group. When e is idempotent and x is the associated character. Lemma ^ can be 
rephrased as 

ZsAx) = Zs^ie). 
To proceed recall the Eulerian idempotents e;^, j = 1, • • • ,n in the group algebra QSn of the 



symmetric group over the rationals. These can be defined | GerS | as follows. Let Si^n~i = J2'^ 
where the sum is over all (") permutations w such that w{l) < • • • < w{i), w{i + !)<•••< w{n) 
and let s„, = Y^^Zi Si^n-i- Letting /ij = 2-' — 2, the e^ are defined as 



e^ 



n 



fj-i 



They are orthogonal idempotents which sum to the identity. 

The following result, which we shall need, is due to Hanlon. The symbol /x denotes the Moebius 
function of elementary number theory. 

Theorem 5 ( ^ar^ ) 

CO n 
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Theorem 6 f^J) 

« = 1 weSn \ / 

Remark: Combining Lemma ^ and Theorem |6|, one sees that the formula for the cycle structure 
of a riffle shuffle [ DMP | and Theorem g imply each other. It is interesting that both proofs used a 



bijection of Gessel and Reutenauer | GesR | 



To continue, we let en denote the idempotent obtained by multiplying the coefflcient of w in e;^ 
by sgn{w). Let A„+i be the n + 1 cycle (1 2 • • • n + 1) and A„+i = ;^ Y.7=ois9nXn^i)Xn+i. Viewing 



en as in the group algebra of Sn+i, Whitehouse HWhfl proves that for j = 1, • • • , n the element A^+iCn 



is an idempotent in the group algebra QS'^.+i, which we denote by f^+i- Whitehouse's main result 
is the following: 



Theorem 7 ([Wh]) Let F^^i,En be the irreducible modules corresponding to the idempotents f„+i 
and ell. Then 

As final preparation for the main result of this section, we link the idempotent An+ien with 
riffle shuffles followed by a cut. 

Lemma 4 The coefficient of w in YJj=ik^K+iel is sgn{w)^{^^'''^'^^'^^) . 

Proof: Given Theorem g this is an elementary combinatorial verification. □ 

Theorem ^ now derives the cycle structure of a permutation distributed as a shuffle followed by 
a cut. So as to simplify the generating functions, recall that X^dli ^('^) vanishes unless i = 1. 

Theorem 8 

^ V- 1 (n + k- Cd{w)\ yr n,{w) 

^ V (n + l)A:«+M n M.^ * 

k-l k k-l\}, k^' 

If k = q is the size of a finite field, this says that the cycle type of a permutation distributed as a 
shuffle followed by the cut has the same law as the factorization type of a monic degree n polynomial 
over Fq with non-vanishing constant term. 

14 



Proof: Replacing Oj by ajA;*(— l)*"^ , it is enough to show that 



1 + E E ^Sn{w)j-^i jna/^ 

n>lweSn+l ^ ' ' \ / I 






Using Lemmas §, P, Q and Theorem 0, one sees that 



1 , v^ v^ I \ 1 (n + k- cd{w)\ ^j ndw) 

1 + E E «^-H(;rTT)i n jn«/^ 



oo n 



= i + EE^'^5.+i(/i^) 

n=lj=l 
oo n j oo n j 

= 1 + E E ^'' E ^^.+1 (^"4r ^ (<)) - E E ^' E ^^.+i (<^) 

ra=lj=l i=l n=lj=l i=l 

oo " J,n+1 _ p. oo n , „+i _ , j 

= i + «iEE^^.«)(^T3r-)-^^^^"-(<+i)^\3r-) 

n=l i=l n=l i=l 

T -. oo n 1 oo n 

= l + aifc^5.(eT) + ^f^E^"E^^.(^) + ^EE^^^s.(<)- 

n=2 i=l n=2 i=l 

To simpHfy things further, recah that J27=i ^s„i^n) is a" since the e^'s sum to the identity. The 
above then becomes 

1 1 oo n 

1 - fcTT - «i + 1^(1 + E E ^^^^.(^))' 

n=l i=l 



so the sought result follows from Theorem |5l 



□ 



Before continuing, we observe that a combinatorial proof of Theorem g (which must exist) would 
give a new proof of Theorem ^, by reversing the steps. 

Upon hearing about Theorem ^, Persi Diaconis immediately asked for the expected number of 
fixed points after a fc-riffle shuffle followed by a cut, suggesting that it should be smaller than for 
a k riffle shuffle. Using the methods of Section 5 of DMP||, one can readily derive analogs of all of 



the results there. As an illustrative example. Corollary |2| shows that the expected number of fixed 
points after a /c-riffle shuffle followed by a cut is the same as for a uniform permutation, namely 1 
(the answer for /c-riffle shuffles is 1 + 1/k + • • • + l/k^~^). Two other examples are worth mentioning 
and will be treated in Corollary |3|. 
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Corollary 2 The expected number of fixed points after a k-riffle shuffle followed by a cut is 1. 
Proof: The case n = 1 is obvious. Multiplying aj by u in the statement of Theorem shows that 



^ V {n+l)k-+A n jl.i * 

k-i k k-lK}:^ k^ ' 

To get the generating function in u (for n 7^ 1) for the expected number of fixed points in a 
riffle shuffle followed by a cut, one multiplies the right hand side by k, sets 02 = 03 = • • • = 1, 
differentiates with respect to ai, and then sets ai = 1. Doing this yields the generating function 

-M + ii J_J_(1 - -j) ' ^d\zf^y ' 
k 



The result now follows from the identity 



hi' 



k^ \ — u 



which is equivalent to the assertion that a monic degree n polynomial over Fq has a unique factor- 
ization into irreducibles, since ^/iJ2d\i l^i.^)^^ is the number of irreducible polynomials of degree 
i over the field F^. □ 

Corollary 3 Fix u with < n < 1. Let N be chosen from {0, 1,2, •••} according to the rule 
that N = with probability y^j^ and N = n > I with probability ^ ~ k-Z ■ Given N, let 
w be the result of a random k shuffle followed by a cut. Let Ni be the number of cycles of w of 
length i. Then the Ni are independent and Ni has a negative binomial distribution with parameters 
1/i J2d\i l^i^) (^* ~ 1) o.iT'd {u/ky. Consequently, for fixed k as n ^ 00, the joint distribution of the 
number of i cycles after a k-shuffle followed by a cut converges to independent negative binomials 
with parameters ^/iJ2d\il^i'^)(.^^ ~ ^) ^'^'^ i^/^Y- 

Proof: Theorem and straightforward manipulations give that 



fc- 1 ^ >-^ n" fn + k- cd{w) - l\ tt n,{w) 
fc ^ c nk^-A n-1 P-^"' 



TT(i _ Q»^ViAE.|.Mrf)(fc'/^-i)^ 



i>l 
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Setting all Oj = 1 gives the equation 



(fc-l)n 
k{l-u) 

Taking reciprocals and multiplying by the first equation gives 






. 1 -^ X (fc- 1)(1 -^) Y^ v^ n" (n+k- cd{w) -Ayr rniw) 
^1-u/k' k-u ^^nA:"-ii n-1 j-^.-'-''* 



J- 1 gilt- 

proving the first assertion of the corollary. 



For the second assertion there is a technique simpler than that in [DMP|. Rearranging the last 
equation gives that 



l-u ^ sr^ sr^ (1 - tt)n" /n + k- cd{w) - l\ -rr n,{w) 



n-l/k' 4:^ ^ n/c"-! V n-1 

' n>lw£Sn \ 

l-l/A: J-Vl-^^ 

Letting g(u) be a generating function with a convergent Taylor series, the limit coefficient of u" in 
Y3^ is simply ^(l). This proves the second assertion. □ 

5 Conjugacy classes 

The aim of this section is to give evidence for the conjecture at the end of Section |2|, in the case 
when gcd{q — l,n) = 1. Note that under this assumption, a uniformly chosen degree n polynomial 
with non-zero constant term and a uniformly chosen degree n polynomial with constant term 1 
have the same chance of factoring into rii i-cycles. Hence in this case the conjecture amounts to the 
assertion that affine shuffles and shuffles followed by a cut, though different probability measures, 
induce the same distribution on conjugacy classes. Before posing a problem which would explain 
why this should hold, some lemmas are needed. 

Lemma 5 If r > 1, then X]f=o Cr{j) = 0. 
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Proof: If I is relatively prime to r, then multiplication by I permutes the numbers {0, 1, ■ • • , r — 1} 
mod r. Thus 

j = 0<l<r j = j = 

a 



Lemma 6 For n > 1, let t be the largest divisor of n such that gcd{cd — l,t) = 1. Suppose that 
gcd{n, q — 1) = 1 and that r divides n and q — cd. Then r divides t. 

Proof: Observe that gcd(r, cd — 1) = 1. For suppose there is some a > 1 dividing r and cd — 1. 
Then a divides q — cd and cd — 1, hence q — 1- Since a divides r and r divides n, it follows that a 
divides n. This contradicts the assumption that gcd{q — l,n) = 1. □ 

Next we pose the problem of determining whether or not the following statement holds. 

Statement 1: For n > 1, let t be the largest divisor of n such that gcd{cd — 1, i) = 1. Then for 
every conjugacy class C of Sn, the set of permutations in C with cd cyclic descents has its major 
index equidistributed mod t. 

Theorem ^ shows that if Statement 1 holds, then the conjecture about the cycle structure of 
permutations distributed as affine shuffles is correct. Some evidence in favor of Statement 1 is then 
given. 

Theorem 9 Suppose that gcd{q — l,n) = 1. If Statement 1 is correct, then affine shuffles and 
shuffles followed by a cut have the same distribution on conjugacy classes. 

Proof: Suppose that Statement 1 is correct and recall the third definition of affine g-shuffles in 
Section |2[ If g < cd(w) then both the affine q'-shuffle and the g-riffle shuffle followed by a cut 
assign probability to w. If g = cd{w), then the affine g-shuffle assigns probability ^j^rr to w 
if maj{w) = mod n, and otherwise. If g = cd{w), then the g-riffle shuffle followed by a cut 
associates probability — jj-rr to w. Since q = cd, the t in Statement 1 is equal to n, which implies 
that for every conjugacy class C, the set of permutations in C with cd cyclic descents has major 
index equidistributed mod n. Hence Statement 1 holds in this case. 

The third and final case is that q > cd{w). Suppose that r > 1 divides n and q — cd. Lemma g 
implies that r divides t. Hence by Statement 1, for any conjugacy class C , the set of permutations 
with cd cyclic descents has its major index equidistributed mod r. Consequently (the second 
equality below holding by Lemma |^ and the equidistribution property mod r), 
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/n+q—cd{w)—r\ 



E E ;3^rT E ,-IdM jCri-maJiw)) 

cd=l wee ^ r\n,q—cd \ r / 

cd{w)^cd 

n 1 / n+q-cd-r \ 

= E;^7;r3T E K^ E C7.(-majH) 

cd=l ^ r\n,q—cd \ r / -^eC 

cd{w)^cd 

_ ^ 1 /n + q — cd — l\ Y^ 

cd{'w)^cd 

Y^ 1 fn + q — cd{w) — l\ 

From Theorem ^ (the formula for a g-riffle shuffle followed by a cut), Statement 1 follows. □ 

Next we consider evidence in favor of the ideas of this section. Incidentally, given Section |^, 
Proposition |l| confirms Conjecture 1 of [ pq] (in type A) when n is prime and g is a power of n. 

Proposition 1 Suppose that n is prime and that q is a power of n. Then type A affine q-shuffies 
are exactly the same as q-riffle shuffles followed by a cut. 

Proof: The probability that an affine q shuffle yields w is 

/n+q~cd{w~^)—r\ 

—^ E ( q-cd\u.--) )a(-maj(u;-i)) 

r\n,q—cd{w~^) \ r / 

Since 1 < cd{w) < n — 1 for any w in S'„, the assumptions on n and q imply that the only r dividing 
n and q — cd{w~^) is r = 1. The result now follows from Theorem |2[ □ 

Theorem 10 Statement 1 holds for the identity conjugacy class and for the conjugacy class of 
simple transpositions. 

Proof: For the identity conjugacy class, use the third definition of afhne g-shuffles in Section |^, 
together with the assumption that gcd{n, q — 1) = 1. 

Next consider the case of simple transpositions. Suppose that n > 4, the other cases being 
trivial. One checks that all simple transpositions {i,j) with i < j have either 2 or 3 cyclic descents. 
The easy case is that of 2 cyclic descents. The possible values of (z,j) are then (i,i + 1) for 
1 < i < n — 2, (1, n) and (n — 1, n). The values of the major index so obtained are {1, ■ ■ ■ ,n} and 
each value is hit once. Thus Statement 1 holds in this case. 
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The harder case is that of 3 cychc descents. The relevant transpositions are {i,j) with 1 < 
i,j < n and j ^ i + 1 (having major index i + j — 1) and (i, n) with 2 < i < n — 1 (having major 
index i + n — 1). 

First suppose that n is odd. It suffices to prove that J2 ^^dJ) x"^"-^^"^^ "^°'^ " is a multiple of 
^^. Calculating gives 



{n-3)/2 n-i-1 . (n-3)/2 n~2 

y^ ^maj{w) mod n _ V^ ^i V^ ^j _|_ J_ V^ ^i V^ ^j 

w=(i,3) 1=1 j=i+l i=l j=n—i 

n— 3 n— 2 n— 3 



+;^ E -^E-^' + E 

j=(n-l)/2 j=i+l i=l 

/(n-3)/2 (n-3)/2 

y xUx''~' - x*+M + — y xVx"-i - x"-^) 

X — 1 \ ^ x" "^ 

y «=i j=i 

+ — y x^(x"-i-x^+^) + x"~2-x 

X — 1 I x" ,"^^, , 

n - 3 x" - 1 



2 x-1 ' 



as desired. 

Next suppose that n = 2'^ with a > 0. It suffices to prove that Y^ w={i,j) x"^"'^^'^^ "^°'^ " is a 

cd(Tjj)^3 

polynomial multiple of ^ ^_^ . Calculating as above (and omitting the steps analogous to the 
previous computation) gives that 



y ^majMmodn ^ ^_(1 + 3,2 ^ ^4 ^ . . . ^ ^n-2 _ ^ _ ^3 _ ^5 ^n-l^ 

r-\ X-1 

cd('w)^3 

x"-l 



X2-1 

^n/2'' _ I 2;(2"-l)n/2'' _^ ^ ^71/2"^ _^ ]_ 

X — 1 X + 1 

Since n/2" is odd, it follows that x(2"~^)"/2" + • • • + x'^/^" + 1 is divisible by x + 1. □ 

6 Patience sorting 

Having described the motivation in the introduction, we outline and then execute a strategy for 
obtaining generating function information for the first pile size in patience sorting from decks with 
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repeated values. The first step is to apply ideas of Foata to obtain generating functions for multiset 
permutations by the number of cycles. The second step is to give a multiset records-to-cycles 
bijection (generalizing the one used in | |AD[ ), which converts information about the distribution of 
cycles to information about the distribution of records. The final step is to read information off of 
the generating function. 

Some notation is needed. Let a denote the vector (oi, 02, • • •) with Oj > and X) Oi < 00. Let 
Mult{a) denote the collection of all (a'^'^!..) words of length X) ^i formed from Oj z's. 

We recall Foata's theory of cycle structure for multisets pb|, following Knuth's superb exposition 



|Kn|. Suppose that the elements of the multiset are linearly ordered. Then multiset permutations 



can be written in two-line notation 

aaabbcdddd 
cabddabdad 

Foata introduced an intercalation product t which multiplies two multiset permutations a and P 
by expressing a and /? in two line notation, juxtaposing these two-line notations and then sorting 
the columns in non-decreasing order of the top line. For example 

a a b c d abddd = aaabbcdddd 

c a d a b t b d d a d cabddabdad 

Foata proved that if the elements of the multiset M be linearly ordered by the relation <, then the 
permutations vr of M correspond exactly to the possible intercalations 

TT = (Xii • • • Xin^yi)T{x2l ' • • X2n2y2)T ' ' T {xtl ' ' ' XtntVt) 

with yi < y2- ■ ■ < yt and yi < Xij for 1 < j < rii, 1 < i < t. This defines a notion of cycle 
structure for multiset permutations by letting the cycles be the intercalation factors. Let Cj(7r) be 
the number of length i cycles of vr and C(7r) = J2 Ci{TT) be the total number of cycles. Let CKn) 
be the number of i-cycles of vr, where cycles with the same minimum value yj are counted at most 
once. Let C"(7r) = ^^^'^•(vr). For example the multiset permutation 

(431)t(231)t(4) 

satisfies C3(7r) = 2, C^(7r) = 1, C{tt) = 3, and C7'(7r) = 2. 

Proposition ^ gives generating functions for multiset permutations. 
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Proposition 2 



1+E E -^^^^n-r = nr 



^fc" 



a iTGMult{a) i>l fe=l 1~X] >fc ^J 



a:fc 



a -K^Multid) i>l fc=l \ l~IT>fc^ 

Proof: Both generating functions follow easily from Foata's method of representing permutations 
by intercalations. The /c's on the right hand-side index the letters of the alphabet. The point is 
that cycles are formed by fixing a smallest element k and specifying an ordered choice of elements 
larger than k; permutations are ordered multisets of such cycles. □ 

We remark that the generating functions of Proposition |^ are quasi-symmetric functions in the 
sense that for any ii < ■ ■ ■ < in and ji < ■ ■ ■ < jn the coefficients of x^^ ■ ■ ■ x^" and x-^ ■ ■ ■ x^" are 
equal. 



Theorem 11 converts information about the distribution of cycles to information about the 
distribution of records. Some further notation is needed for its statement. Let vr'"'^^ be the word 
obtained by reading from right to left the bottom line in the 2-line form of vr. Recalling the definition 
of Solitaire from the introduction, let Pj(7r) and -P/(7r) be the number of cards in pile i of Solitaire 
with ties allowed and ties forbidden respectively. 

Theorem 11 1. For any given a, there is a bijection <I> : Mult[a) i— > Mult[a) such that if 
Ri,- ■ ■ ,Rt are the positions of the left-to-right minima (ties are allowed) of tt'^^'" , then R2 — 
i?i, i?3 — i?2) ■ ' ' ,Rt — Rt-1, {J2 o-i) + ^ — Rt o,re the cycle lengths in Foata's factorization of 
^(vr). 

2. Pi(7r) = C($(7r)) and P[{-n) = C'{^{-k)). 

Proof: For the first assertion, define $(7r) as an intercalation of cycles formed by entries in the 
bottom line of vr, with cycles (from left-to-right) having lengths (X) CLi) + \ — Rt, Rt — Rt-i, • • • , -R2 — 
Ri. The assertion is then evident, and the following example may help to untangle the notation. 
The multiset permutation ii = ddbcdbbcabacdbd has ti'^'^'" = dbdcabacbbdcbdd 
with fii(7r'''^^) = 1, R2{Tr''^'') = 2, Rsiir''^'") = 5, fi4(vr'^'') = 7, and E a* + 1 = 16. Thus forming 
from vr cycles of lengths 9, 2, 3, 1 gives <I>(7r) as the intercalation 

{ddbcdbbc a)T{b a)T{c d b)T{d). 



22 



For the second assertion, we give the argument for the first equahty, the argument for the second 
assertion being analogous. The point is that the number of cards in pile 1 of Solitaire with ties 
allowed applied to vr is simply the number of left-to-right minima with ties allowed of vr. The result 
now follows from the first assertion. □ 

Proposition H shows that when one considers random words from a finite alphabet, there is a 
factorization for the full cycle structure vector, not only the number of cycles. Recent work of Tracy 
and Widom |TW| ] connects random words with random matrices chosen from the Laguerre ensemble. 
Let Wordn(N) be the A^" words of length n from an alphabet on N letters (say 1, 2, • • • , N). Each 
such word can be viewed as a multiset permutation. 
Proposition 3 

N N 



Z^ ATn 1^ 11^/ 11"/ - 11 11 I -I _ Xfe /v-AT £ixi 

" TT&WordniN) k=l i>l i>lfc=l\-^ "-i N \^j=k+l N J 






ATn Zl^ 11 4 11 11 -. u,x^ ( N~k \i^l ' 

■K&Wordr,{N)i>l i>l k=l ^ N \ N > 



n ■wPWnrd^(N\k=\ i>l i>lk=\ \ ^ 2^j=k+l 



TeWord„{N)k=l i>l i>l k=l \ "^ ^j=k+l N / 

(i--)+E^^^ E n-?^^^=nnTT^=fz=f- 

n TT€Wordn(N) i>l i>l k=l ^ ^ N TZy^ Z 

Proof: For the first assertion, note by Foata's representation of multiset permutations as interca- 
lations that each z-cycle is formed by fixing a smallest element k and specifying an ordered choice 
of i — 1 elements larger than k to occupy the first i — 1 positions of the cycle. Since multiset 
permutations are ordered multisets of such cycles, one concludes that 

N N -, 






n TT&Wordn{N) k=l i>l i>l 

Now replace each Xihy j^. 

To prove the second assertion, replacing each xt by x in the first yields the equation 

1 ^ 1 



1+Ei^ E -"n-r^=nn 



J\J-n A^ 11 « 11 11 , UjX^ ( N-k \i~l 

TTeWord„{N) i>l i>l k=l '^ N ^ N ^ 
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Setting all Uj = 1 and taking reciprocals shows that 

i>ik=i \ ^* ^■^ / 

The result follows by multiplying the previous two equations. 

The arguments for the third and fourth assertions are analogous. □ 

The second and fourth equations have probabilistic interpretations. For instance in the second 
equation, fix x such that < x < 1. The equation then says that if one picks n geometrically 
with probability (1 — x)x'^ and then picks vr G Wordn{N) uniformly at random, the random 
variables Ci^n) are sums of independent geometries. In the fourth equation the C'^ become sums of 
independent binomials. Lemma |7| permits asymptotic statements in the n — > oo limit. 

Lemma 7 If f{\) < oo and the Taylor series of f around converges at u = 1, then 

1 — u 
PROOF:Write the Taylor expansion /(n) = Y.'^=o "-nu"" ■ Then observe that [u^lyr^ = Y.7=o"'i- ^ 

As a corollary, one sees for instance that as n ^ oo, the number of i-cycles of a random length n 
word from the alphabet {1, • • • , A^} converges to a sum of independent geometries with parameters 
(1 — 7y)*~^ ^^ ^ ~ 1) ■ ■ ■ ) A^- For more on this type of factorization result and its applications, 



see pjlS(| for the symmetric groups, |DS| for the compact classical groups, and p^ ] for the finite 
classical groups. 

Finally, we consider the application of Proposition and Theorem O to patience sorting. As 
above, -Pi(vr) and -Pi(vr) be the number of cards in pile 1 of patience sorting with ties allowed and 
ties forbidden respectively. 

Theorem 12 Let tt be chosen uniformly at random from the possible ordering s of a deck of cards 
with Oj cards labelled i. Then EiPi) = Y'i. „ . "^ xr cind E(Pi) = J2h „ _i_ _l'!.'° — r:;-- 

Proof: By Proposition |2| and Theorem 11, 

a TT€Mult{a) i>l fc=l 1-^ .^^ xj 

Differentiating with respect to u and setting u = 1 implies that the sought expectation is 
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n ^ Coeff. ofl[x1' inJ2 . % r. ^^ 

\aua2,-) k ^ - Z. a;, i - 2.^>fc Xj 



1 Y^ y^ / ai H \- a-k-i +bk + &fc+i + 



LuL-^ k:a,>ob,,b,+i,->o\ ai,---,ak^i,bk,bk+i,--- J 

/afc - 1 - 6fc + ttk+i - bk+i -\ \ 

\ ak — I — bk,ak+i — bk+i,- ■ ■ J 

1 V^ I V^ ^ bk IW+i'W+2' 



Letting 5 = 6^ + 6fc_(_i + • • •, this simplifies to 



afc+afc+iH 1 /aj,+a^^^^ ix 



>r^ 1 Y^ /aiH hofc-i + s 



E 



flfc 



^ ai H h fl/fc-i + 1 

The second calculation is similar. □ 

As a final result, we study patience sorting applied to I^n^ the fixed point free involutions in 
the symmetric group S2n ■ By [|Raf| , the number of piles in such a game relates to the eigenvalues 
of random symplectic and orthogonal matrices. Consequently this restricted version of patience 
sorting merits further study. Proposition Q shows that the generating function for the first pile size 
factors. 
Proposition 4 

^ x^iW = n(x2 + 2(i-l)) 

Proof: The proposition is proved by induction, the base case being trivial. Suppose that the 
proposition holds for /2(ra-i)- Given vr E /2n let j be the symbol with which 2n is switched. If 
j ^ 1, then -Pi(7r) is the same as Pi{tt') where vr' is obtained by crossing the symbols j and 2n out 
of vr. If j = 1, then Pi{tt) = -Pi(7r') + 2, where -Pi(vr') is obtained by crossing the symbols l,2n out 
of vr. Consequently, 
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J2 x^'^^'' = (2n - 2) ^ x^iW + ^' E ^""'^"^ 
and the result follows by induction. □ 
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